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Nucleic Acids, Proteins, and Ar^tibodies 

Statement under 3 7 C.F.R. § 1. 77(b)(4) 

[1] This application refers to a "Sequence Listing'' listed below, which is provided as 

an electronic document on two identical compact discs (CD-R), labeled "Copy 1" and "Copy 
2." These compact discs each contain the following files, which are hereby incorporated in 
their entirety herein: 



Document 


File Name 


Size in bytes 


Date of Creation 


Sequence Listing 


PTZ32_seqList.txt 


3,411,250 


01/15/2001 


V Viewer Setup File 


SetupDLL.exe 


695,808 


12/19/2000 


V Viewer Help File Controller 


v.cnt 


7,984 


01/05/2001 


V Viewer Program File 


v.exe 


753,664 


12/19/2000 


V Viewer Help File 


v.hlp 


447,766 


01/05/2001 



[2] The Sequence Listing may be viewed on an IBM-PC machine running the MS- 

Windows operating system by using the V viewer software, licensed by HGS, Inc., included 
on the compact discs (see World Wid.e Web URL: http://www.fileviewerxom). 



Field of the Invention 

[3] The present invention relates to novel proteins. More specifically, isolated nucleic 

acid molecules are provided encoding novel polypeptides. Novel polypeptides and antibodies 
that bind to these polypeptides are provided. Also provided are vectors, host cells, and 
recombinant and synthetic methods for producing human polynucleotides and/or 
polypeptides, and antibodies. The invention further relates to diagnostic and therapeutic 
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methods useful for diagnosing, treating, preventing and/or prognosing disorders related to 
these novel polypeptides. The invention further relates to screening methods for identifying 
agonists and antagonists of polynucleotides and polypeptides of the invention. The present 
invention further relates to methods and/or compositions for inhibiting or enhancing the 
production and function of the polypeptides of the present invention. 

Background of the Invention 

[4] One of the most critical tasks a cell must perform is to respond to cues from its 

environment, i.e., extracellular, signals. Some of the most important extracellular signals 
come from other cells. The ability for cells to be able to send and receive signals from one 
another is of paramount importance in multicellular organisms because it allows individual 
cells within a body to become highly specialized and yet work in a coordinated fashion with 
other cells of the body. Cellular signaling mechanisms regulate a variety of cellular 
processes such as, for example, proliferation, differentiation, survival, movement, and 
secretion. Defects in cellular signaling can lead to a number of diseases and disorders such 
as cancers, immune system disorders and nervous system disorders. For more expansive 
reviews on this subject, please refer to Hunter, Cell 100:113-127 and Chapter 15 of 
Molecular Biology of the Cell Third Edition, edited by Alberts et al. (1994), which are herein 
incorporated by reference in their entirety. 

[5] Signal transduction requires molecules that serve as the extracellular signaling 

molecules as well as a set of receptors that "receive" the signal. Frequently, an additional set 
of proteins is necessary in order for the cell to translate the signal it has received into an 
appropriate response via the activation or inhibition of a particular set of genes or proteins. 
The signaling molecules, the receptor proteins, and the molecules that relay the signal 
between the receptor and the final effector molecules collectively form what are known as 
signal transduction pathways. 

[6] To date, several common types of signal transduction pathways have been 

identified. One way to classify a signal transduction pathway is based on the class, of 
receptor protein it utilizes. Two well known classes of receptor proteins are G-protein 
coupled receptors and enzyme-linked receptors. This latter class of enzyme-linked receptors 
includes receptor tyrosine kinases, tyrosine kinase associated receptors, receptor 
serine/threonine kinases, receptor tyrosine phosphatases, and receptor guanylyl cyclases. 
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Signal Transduction through G-protein Coupled Receptors 

[7] G protein coupled receptors are the largest family of cell surface receptors. They 

are seven-pass transmembrane receptors which activate trimeric G proteins (G proteins) upon 
ligand binding. G proteins are GTPases composed of three sub units: alpha, beta and gamma. 
G proteins function as molecular switches existing in two states: an active GTP bound state 
and an inactive GDP bound state. Ligand binding to G protein coupled receptors induce 
inactive G proteins to release GDP allowing GTP to bind in its place. Binding of GTP to a G 
protein causes the alpha subunit to dissociate from the beta and gamma subunits which 
remain associated with one an9ther. Eventually, the GTPase activity of the alpha subunit 
results in hydrolysis of the bound GTP molecule to GDP, thus inactivating the G protein. 
^11 [8] There are several types of G proteins that have been classified based upon their 

1=11 function. Stimulatory G proteins (Gg) are involved in adenylate cyclase activation; inhibitory 
j.J: G proteins (Gi) function to inhibit the activity of adenylate cyclase. Yet another type of G 
Cfl protein , Gq proteins, functions in the activation of phosphoinositi de-specific phospho lipase C 
enzyme, 

I - [9] Activation of adenylate cyclase by an activated Gs protein results in the production 

\^ of the cyclic nucleotide, cyclic AMP (cAMP). cAMP mediates its effects mostly through its 
ii^J activation of cAMP dependent kinase (A-kinase), a serine/threonine kinase. Activation of A- 
kinase helps to further relay the signal from the G protein coupled receptor to the target 
proteins. In muscle cells, for instance, activation of A-kinase following adrenaline signaling 
ultimately results in the activation of an enzyme, glycogen phosphorylase, which catalyzes 
the release of glucose molecules which can be used to produce energy from glycogen. In 
other instances, activated A-kinase translocates to the nucleus where it phosphorylates the 
cAMP response element binding (CREB) protein, which when phosphorylated, acts as a 
transcription factor to stimulate the expression of genes that have cAMP response elements 
(CRE) sequences in their regulatory regions, 

[10] Gq proteins, when activated, activate the enzyme phospholipase C-beta which 
hydrolyzes PI 4,5-biphosphate (PIP2) producing inositol triphosphate (IP3) and 
diacylglycerol (DAG). IP3 functions as a second messenger that causes the release of Ca^^ 
from intracellular stores. Released calcium then binds to Ca^"^ binding proteins such as 
calmodulin, which in its calcium bound state, is able to activate Ca^Vcalmodulin dependent 
protein kinases (CaM-kinases). Activated CaM kinases then continue to relay the signal to 
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more downstream molecules in the signal transduction pathway. The other product produced 
by phospholipase C-beta, DAG, functions to activate the serine/threonine kinase known as 
protein kinase C (PKC). Activated PKC phosphorylates target proteins depending on the cell 
type, and in many cells these phosphorylation events lead to the increased transcription of 
specific genes. The highest concentrations of protein kinase C are found in the brain where 
PKC phosphorylates ion channels in nerve cells thereby altering their excitability. PKC 
activation can be induced by treating cells with phorbol esters which are able to cross the 
plasma membrane, bind to, and activate PKC directly. 

Signal Transduction through Receptor Tyrosine Kinases 

[11] The receptor protein tyrosine kinases (RPTKs) are some of the most well studied 
receptors, and the signaling cascades they initiate demonstrate two of the fundamental 
concepts in signal transduction: the regulation of protein phosphorylation and the recruitment 
of proteins into a signaling cascade via protein-protein interaction domains. 
[12] Binding of the cognate ligand to a RPTK, such as epidermal growth factor (EGF) 
binding to the epidermal growth factor receptor (EGFR), induces RPTKs to dimerize and 
cross-phosphorylate each other on multiple tyrosine residues. The phosphorylated receptor 
dimer is the activated form of the receptor. 

[13] The phosphorylated tyrosines on activated RPTKs are then recognized/bound by 
other components of the signal transduction pathway. One of the important discoveries in the 
field of signal transduction was the recognition of conserved domains which allow for 
protein-protein interactions in signaling pathways. The most prevalent binding domain that 
recognizes phosphotyrosine (P-Tyr) residues is known as the SH2 domain (for Src homology 
region 2, named after the Src protein in which the SH2 domain was first discovered). 
Another domain that recognizes P-Tyr residues is called the P-Tyr binding domain (PTB). 
The discovery of the SH2 domain was quickly followed by the discovery of several other 
protein-protein interaction domains involved in signal transduction and by the realization that 
most of these domains are modular in nature, meaning these domains fold independently - a 
most convenient feature for protein engineering. To date, more than 100 such protein 
interaction domains involved in signaling have been defined via comparative sequence 
analysis. Most of these domains recognize short linear sequences (approximately 4-10 amino 
acid residues in length), in some cases requiring phosphorylation of specific residues within 
the sequence allowing for inducible association. A convenient web based database, with 
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links to abstracts of papers characterizing these domains can be found at http ://sniart. EM B L - 
Heidelbere.de . 

[14] Proteins containing SH2 and PTB domains translocate to the plasma membrane 
where they associate with the activated RPTKs which, in turn, activates them through 
phosphorylation. By way of example, activation of the platelet derived growth factor 
receptor (PDGFR) results in the autophosphorylation of tyrosine residues in the cytoplasmic 
tail of the PDGFR. These P-Tyr residues then serve as the binding sites for other proteins, 
such as a GTPase (discussed in more detail below), phospholipase C-gamma, and the 
regulatory subunit of PI-3-kinase, which are each able to recognize the P-Tyr residues in 
PDGFR via SH2 domains. The interaction of these proteins with the activated PDGFR 
results in the translocation of these proteins to the plasma membranes where they have their 
% substrates and the PDGFR mediated activation of these proteins via phosphorylation. 

[15] In the previous example, each of the proteins recruited to the activated RPTK via 
^ their SH2 domains also had catalytic activities that allowed them to propagate a signal. 
;ii There are proteins involved in signal transduction, however, which have no ability in and of 
themselves to propagate a signal. Instead, these proteins, known as adaptor proteins, serve 
-to couple activated RPTKs to other components of the signal transduction pathway which do 
have the capacity to propagate the signal. One such adaptor protein is known as Grb2. It 
;Jf contains one SH2 domain and two SH3 domains (another Src homology domain that 
mediates protein interactions). Grb 2 is constitutively associated with Sos protein, a guanine 
nucleotide releasing protein (GNRP), via its SH3 domain. Thus, when Grb2 associates with 
an activated receptor via its SH2 domain, it also brings Sos into proximity with the RPTK 
which activates the Sos protein via phosphorylation. 
^ [16] GNRP proteins, such as Sos, are one of two types of proteins that help regulate the 
activity of proteins belonging to the Ras superfamily of monomeric GTPases. Ras proteins 
are proteins that are associated with the cytoplasmic side of the plasma membrane and help 
relay signals from RPTK to the nucleus to stimulate cell proliferation or differentiation. Ras 
proteins exist in two states, an inactive state in which ras is bound to GDP and an active state 
in which ras is bound to GTP. Activated GNRP proteins promote the exchange of bound 
GDP for GTP on ras proteins, thereby activating ras. Ras, itself, is a GTPase that hydro lyzes 
GTP to GDP, and would therefore tend to inactivate itself over time. However, ras is an 
inefficient GTPase, so the inactivation of ras is enhanced by GTPase activating proteins 
(GAPs) which increase the rate of hydrolysis of GTP by ras. 
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[17] Activated Ras kinases then act to activate more downstream signaling events, 
including activation of the mitogen-activated protein kinase (MAPK) pathway which is a 
cascade of serine/ threonine kinases. Ras binds to and activates a MAPK kinase kinase 
(MAPKKK, such as Raf-1, for example), which in turn activates a MAPK kinase (MAPKK) 
via phosphorylation, which in turn activates a MAPK. MAPKs relay signals downstream by 
phosphorylating various proteins in the cell including other kinases and/or regulatory proteins 
in the cell. For instance, an activated MAPK can enter the nucleus and help to initiate 
transcription of genes that must be expressed in order for the cell to respond to the 
extracellular signal, such as genes required for DNA replication in response to the 
extracellular proliferation signal. 

[18] Another class of signaling receptors, receptor serine/threonine kinases (RSK) has 
recently been identified. An example of an RSK is the TGF-beta receptor. Additionally, it 

= 

has also been recently recognized that there are modular binding domains that recognize 
|: phosphoserine/phosphothreonine (P-Ser/P-Thr) residues. For instance, 14-3-3 domains 
;JJ recognize phosphoserines in specific amino acid contexts [RSX(P-Ser)XP] or [R(Y/F)X(P- 
Ser)XP] and may function in the assembly of signaling complexes. Other residues such as 
histidine and arginine can also be phosphorylated, and it is possible that additional kinases 
^ which phosphorylate these residues, or protein domains that bind phosphohistidine or 
M phosphoarginine will be discovered. 

Signaling Via Intracellular Receptors 

[19] Some extracellular signals do not have cell surface receptors such as G protein 

coupled receptors or receptor tyrosine kinases. Instead, these extracellular signals are able to 
traverse the plasma membrane and interact with their receptors in the cytoplasm. Examples 
of such signals are the steroid hormones and the gas nitrous oxide (NO). The steroid 
hormone receptors, once bound by their ligand, are generally able to translocate to the 
nucleus where they bind regulatory DNA elements that control the gene expression of 
specific genes. NO gas, on the other hand, generally enters a cell and reacts with iron in the 
active site of the enzyme guanylate cyclase, stimulating it to produce cyclic GMP (cGMP). 
cGMP acts as a second messenger (similar to the way cAMP functions) and can stimulate 
further downstream signaling by binding to other proteins. 
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Terminating Signal Transduction 

[20] As the effects of signal transduction are transient, there must also be mechanisms 
for terminating signal cascades. For example, G proteins are self-inactivating, and there are a 
set of proteins, GAPs, that are devoted to increasing the rate of hydrolysis of bound GTP by 
ras proteins. Cyclic nucleotide second messngers such as cAMP and cGMP are hydrolyzed 
by phosphodiesterases. In the case of kinases, there generally exist a set of complementary 
phosphatases that function to dephosphorylate phosphorylated residues, thereby bringing the 
signaling event to a close. 

Signal Transduction Pathway Components and Disease 

[21] Because signal transduction is involved in the regulation of so many cellular 
processes, including proliferation, differentiation, survival, and apoptosis, it is not surprising 
that defects in cellular signal transduction pathway components lead to a number of diseases 
and disorders, especially cancers. For a review on Signal transduction pathway components 
and diseases, see Hunter, Philosophical Transactions of the Royal Society of London Series 
B 353:583-605 (1998) which is herein incorporated by reference in its entirety. For instance, 
approximately 30% of human cancers have mutations in a ras gene, and at least 18 tyrosine 
kinases have been identified as oncogenes in either acutely transforming retroviruses or in 
human tumors, such as for example, Src. And more than 95% of chronic myelogenous 
leukemias express an activated form of the c-Abl non-receptor tyrosine kinases. 
[22] Mutations in signaling pathways are also implicated in a plethora of other diseases. 
Mutation in Bruton's tyrosine kinase leads to X-linked aganunaglobulinemia. Inactivation of 
ZAP70 or JAK3 leads to a severe combined immunodeficiency disease. Coffin-Lowry 
syndrome occurs when the X-linked Rsk2 protein serine kinase gene is inactivated. 
Myotonic dystrophy occurs when expression of the myotonic dystrophy serine kinase gene is 
decreased. Overexpression of the aurora2 serine kinase is implicated in colon carcinoma. 
[23] The malfunction of signal transduction pathway components, particularly kinases, 
in diseases indicate that these genes are good targets for drugs/pharmaceuticals that either 
inhibit or activate their function. In fact, some such drugs have been developed and are 
already in use or in clinical trials. For instance, an inhibitor of cyclin dependent kinase 2 
(cdk2), a kinase important in regulating cellular proliferation, is in clinical trials for cancer 
treatment, as are inhibitors of epidermal growth factor receptor tyrosine kinases and vascular 
endothelial growth factor receptor (VEGFR) tyrosine kinases. Inhibition of VEGFR activity 
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reduces or eliminates the vascularization of tumors directed by VEGFR. An antagonistic 
monoclonal antibody, herceptin, against the erbB2 receptor tyrosine kinase is being used in 
breast cancer therapies to treat breast cancers where ErbB2 is overexpressed, 
[24] Thus there exists a clear need for identifying and exploiting novel signal 
transduction pathway component polynucleotides and polypeptides. Although structurally 
related, such proteins may possess diverse and multifaceted functions in a variety of cell and 
tissue types. The inventive purified signal transduction pathway component polypeptides are 
research tools useful for the identification, characterization and purification of additional 
proteins involved in signal transduction. Furthermore, the identification of new signal 
transduction pathway component polynucleotides and polypeptides permits the development 
of a range of derivatives, agonists and antagonists at the nucleic acid and protein levels which 
in turn have applications in the treatment and diagnosis of a range of conditions such as, for 
example, cancer and other proliferative disorders (e.g., chronic myelogenous leukemia), 
immunological disorders (e.g., severe combined immunodeficiency and X-linked 
agammaglobulinemia), and nervous system disorders (Coffin-Lowry Syndrome), amongst 
other conditions. 

Summary of the Invention 
[25] The present invention relates to novel proteins. More specifically, isolated nucleic 
acid molecules are provided encoding novel polypeptides. Novel polypeptides and antibodies 
that bind to these polypeptides are provided. Also provided are vectors, host cells, and 
recombinant and synthetic methods for producing human polynucleotides and/or 
polypeptides, and antibodies. The invention ftirther relates to diagnostic and therapeutic 
methods useful for diagnosing, treating, preventing and/or prognosing disorders related to 
these novel polypeptides. The invention fiirther relates. to screening methods for identifying 
agonists and antagonists of polynucleotides and polypeptides of the invention. The present 
invention further relates to methods and/or compositions for inhibiting or enhancing the 
production and fianction of the polypeptides of the present invention. 

Detailed Description 

Tables 
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[26] Table lA summarizes some of the polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences (contig 
identifier (Contig ID:) and contig nucleotide sequence identifier (SEQ ID NO:X)) and further 
summarizes certain characteristics of these polynucleotides and the polypeptides encoded 
thereby. The first column provides the gene number in the application for each clone 
identifier. The second column provides a unique clone identifier, "Clone ID NO:Z", for a 
cDNA clone related to each contig sequence disclosed in Table lA. The third column 
provides a unique contig identifier, "Contig ID:" for each of the contig sequences disclosed 
in Table lA. The fourth column provides the sequence identifier, "SEQ ID NO:X", for each 
of the contig sequences disclosed in Table lA. The fifth colunm, "ORE (From-To)", 
provides the location (i.e., nucleotide position numbers) within the polynucleotide sequence 

?l of SEQ ID NO:X that delineate the preferred open reading frame (ORE) that encodes the 

^ amino acid sequence shown in the sequence listing and referenced in Table lA as SEQ ID 
NO:Y (column 6). Column 7 lists residues comprising predicted epitopes contained in the 

|1 polypeptides encoded by each of the preferred ORFs (SEQ ID NO:Y). Identification of 
potential immunogenic regions was performed according to the method of Jameson and Wolf 
(CABIOS, 4; 181-186 (1988)); specifically, the Genetics Computer Group (GCG) 

..L implementafion of this algorithm, embodied in the program PEPTIDE STRUCTURE 
(Wisconsin Package vlO.O, Genetics Computer Group (GCG), Madison, Wise). This method 

--^ returns a measure of the probability that a given residue is found on the surface of the protein. 
Regions where the antigenic index score is greater than 0.9 over at least 6 amino acids are 
indicated in Table lA as "Predicted Epitopes". In particular embodiments, polypeptides of 
the invention comprise, or alternatively consist of, one, two, three, four, five or more of the 
predicted epitopes described in Table lA. It will be appreciated that depending on the 
analytical criteria used to predict antigenic determinants, the exact address of the determinant 
may vary slightly. Column 8, "Tissue Distribution" shows the expression profile of tissue, 
cells, and/or cell line libraries which express the polynucleotides of the invention. The first 
number in column 8 (preceding the colon), represents the tissue/cell source identifier code 
corresponding to the key provided in Table 4. Expression of these polynucleotides- was not 
observed in the other tissues and/or cell libraries tested. For those identifier codes in which 
the first two letters are not "AR", the second number in column 8 (following the colon), 
represents the number of times a sequence corresponding to the reference polynucleotide 
sequence (e.g., SEQ ID NO:X) was identified in the tissue/cell source. Those tissue/cell 
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source identifier codes in which the first two letters are ''AR" designate information 
generated using DNA array technology. Utihzing this technology, cDNAs were amplified by 
PGR and then transferred, in duplicate, onto the array. Gene expression was assayed through 
hybridization of first strand cDNA probes to the DNA array. cDNA probes were generated 
from total RNA extracted from a variety of different tissues and cell lines. Probe synthesis 
was performed in the presence of ^^P dGTP, using oligo(dT) to prime reverse transcription. 
After hybridization, high stringency washing conditions were employed to remove non- 
specific hybrids from the array. The remaining signal, emanating from each gene target, was 
measured using a Phosphorimager. Gene expression was reported as Phosphor Stimulating 
Luminescence (PSL) which reflects the level of phosphor signal generated from the probe 
hybridized to each of the gene targets represented on the array. A local background signal 
subtraction was performed before the total signal generated from each array was used to 
normalize gene expression between the different hybridizations. The value presented after 
"[array code]:" represents the mean of the duplicate values, following background subtraction 
and probe noraialization. One of skill in the art could routinely use this information to 
identify normal and/or diseased tissue(s) which show a predominant expression pattern of the 
corresponding polynucleotide of the invention or to identify polynucleotides which show 
predominant and/or specific tissue and/or cell expression. Column 9 provides the 
chromosomal location of polynucleotides corresponding to SEQ ID NO:X. Chromosomal 
location was determined by finding exact matches to EST and cDNA sequences contained in 
the NCBI (National Center for Biotechnology Information) UniGene database. Given a 
presumptive chromosomal location, disease locus association was determined by comparison 
with the Morbid Map, derived from Online Mendelian Inheritance in Man (Online Mendehan 
Inheritance in Man, OMIM™. McKusick-Nathans Institute for Genetic Medicine, Johns 
Hopkins University (Baltimore, MD) and National Center for Biotechnology Information, 
National Library of Medicine (Bethesda, MD) 2000.. World Wide Web URL: 
http://www.ncbi.nlm.nih.gov/omim/). If the putative chromosomal location of the Query 
overlaps with the chromosomal location of a Morbid Map entry, an OMIM identification 
number is disclosed in column 10 labeled "OMIM Disease Reference(s)". A key to the 
OMIM reference identification numbers is provided in Table 5. 

[27] Table IB summarizes additional polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences (contig 
identifier (Contig ID:) contig nucleotide sequence identifiers (SEQ ID NO:X)), and genomic 
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sequences (SEQ ID NO:B). The first column provides a unique clone identifier, "Clone ID 
NO:Z", for a cDNA clone related to each contig sequence. The second column provides the 
sequence identifier, "SEQ ID NO:X", for each contig sequence. The third column provides a 
unique contig identifier, "Contig ID:" for each contig sequence. The fourth column, provides 
a BAG identifier "BAG ID NO:A" for the BAG clone referenced in the corresponding row of 
the table. The fifth column provides the nucleotide sequence identifier, "SEQ ID NO:B" for 
a fragment of the BAC clone identified in column four of the corresponding row of the table. 
The sixth column, "Exon From-To", provides the location (i.e., nucleotide position numbers) 
within the polynucleotide sequence of SEQ ID NO:B which delineate certain polynucleotides 
of the invention that are also exemplary members of polynucleotide sequences that encode 
polypeptides of the invention (e.g., polypeptides containing amino acid sequences encoded 
by the polynucleotide sequences delineated in column six, and fragments and variants 
thereof). 

[28] Table 2 summarizes homology and features of some of the polypeptides of the 
invention. The first column provides a unique clone identifier, "Clone ID NO:Z", 
corresponding to a cDNA clone disclosed in Table lA. The second column provides the 
unique contig identifier, "Contig ID:" corresponding to contigs in Table 1 A and allowing for 
correlation with the information in Table lA. The third column provides the sequence 
identifier, "SEQ ID NO:X", for the contig polynucleotide sequence. The fourth column 
provides the analysis method by which the homology/identity disclosed in the Table was 
determined. Comparisons were made between polypeptides encoded by the polynucleotides 
of the invention and either a non-redundant protein database (herein referred to as "NR"), or 
a database of protein families (herein referred to as "PFAM") as further described below. 
The fifth column provides a description of the PFAM/NR hit having a significant match to a 
polypeptide of the invention. Column six provides the accession number of the PFAM/NR 
hit disclosed in the fifth column'. Column seven, "Score/Percent Identity", provides a quality 
score or the percent identity, of the hit disclosed in columns five and six. Columns 8 and 9, 
"NT From" and "NT To" respectively, delineate the polynucleotides in "SEQ ID NO:X" that 
encode a polypeptide having a significant match" to the PFAM/NR database as disclosed in 
the fifth and sixth columns. In specific embodiments polypeptides of the invention comprise, 
or alternatively consist of, an amino acid sequence encoded by a polynucleotide in SEQ ID 
NO:X as delineated in columns 8 and 9, or fragments or variants thereof 
[29] Table 3 provides polynucleotide sequences that may be disclaimed according to 



11 



certain embodiments of the invention. The first column provides a unique clone identifier, 
"Clone ID", for a cDNA clone related to contig sequences disclosed in Table- 1 A. The 
second column provides the sequence identifier, "SEQ ID NO:X", for contig sequences 
disclosed in Table lA. The third column provides the unique contig identifier, "Contig ID:", 
for contigs disclosed in Table I A. The fourth column provides a unique integer 'a' where 'a' 
is any integer between I and the final nucleotide minus 15 of SEQ ID NO:X, and the fifth 
column provides a unique integer 'b' where 'b' is any integer between 15 and the final 
nucleotide of SEQ ID NO:X, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:X, and where b is greater than or equal to a + 14. For each of 
the polynucleotides shown as SEQ ID NO:X, the uniquely defined integers can be substituted 
into the general formula of a-b, and used to describe polynucleotides which may be 
n preferably excluded from the invention. In certain embodiments, preferably excluded from 
the invention are at least one, two, three, four, five, ten, or more of the polynucleotide 
sequence(s) having the accession number(s) disclosed in the sixth column of this Table 
!fl (including for example, published sequence in connection with a particular BAC clone). In 
]" further embodiments, preferably excluded from the invention are the specific polynucleotide 
sequence(s) contained in the clones corresponding to at least one, two, three, four, five, ten, 
or more of the available material having the accession numbers identified in the sixth column 
of this Table (including for example, the actual sequence contained in an identified BAC 
clone). 

[30] Table 4 provides a key to the tissue/cell source identifier code disclosed in Table 
lA, column 8. Column 1 provides the tissue/cell source identifier code disclosed in Table 1 A, 
Column 8. Columns 2-5 provide a description of the tissue or cell source. Codes 
corresponding to diseased tissues are indicated in column 6 with the word "disease". The use 
of the word "disease" in column 6 is non-limiting. The tissue or cell source may be specific 
(e.g. a neoplasm), or may be disease-associated (e.g., a tissue sample from a normal portion 
of a diseased organ). Furthermore, tissues and/or cells lacking the "disease" designation may 
still be derived from sources directly or indirectly involved in a disease state or disorder, and 
therefore may have a further utility in that disease state or disorder. In numerous cases where 
the tissue/cell source is a library, column 7 identifies the vector used to generate the library. 
[31] Table 5 provides a key to the OMIM reference identification numbers disclosed in 
Table lA, column 10. OMIM reference identification numbers (Column 1) were derived 
from Online Mendelian Inheritance in Man (Online Mendelian Inheritance in Man, OMIM. 
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McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University (Baltimore, 
MD) and National Center for Biotechnology Information, National Library of Medicine, 
(Bethesda, MD) 2000. World Wide Web URL: http://www.ncbi.nlm.nih.gov/omim/). 
Column 2 provides diseases associated with the cytologic band disclosed in Table lA, 
column 9, as determined using the Morbid Map database. 

[32] Table 6 summarizes ATCC Deposits, Deposit dates, and ATCC designation 
numbers of deposits made with the ATCC in connection with the present application. 
[33] Table 7 shows the cDNA libraries sequenced, and ATCC designation numbers and 
vector information relating to these cDNA libraries. 

[34] Table 8 provides a . physical characterization of clones encompassed by the 
i^^-, invention. The first column provides the unique clone identifier, "Clone ID NO:Z", for 

certain cDNA clones of the invention, as described in Table 1 A. The second column provides 
Hp the size of the cDNA insert contained in the corresponding cDNA clone. 

Definitions 

L.b [35] The following definitions are provided to facilitate understanding of certain terms 
f ^ used throughout this specification. 

iij [36] In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered "by 
the hand of man" from its natural state. For example, an isolated polynucleotide could be 
part of a vector or a composition of matter, or could be contained within a cell, and still be 
"isolated" because that vector, composition of matter, or particular cell is not the original 
environment of the polynucleotide. The term "isolated" does not refer to genomic or cDNA 
libraries, whole cell total or mRNA preparations, genomic DNA preparations (including 
those separated by electrophoresis and transferred onto blots), sheared whole cell genomic 
DNA preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide/sequences of the present invention. 

[37] As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence encoding SEQ ID NO:Y or a firagment or variant thereof; a nucleic acid sequence 
contained in SEQ ID NO:X (as described in column 3 of Table lA) or the complement 
thereof; a cDNA sequence contained in Clone ID NO:Z (as described in column 2 of Table 
1 A and contained within a library deposited with the ATCC); a nucleotide sequence encoding 
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the polypeptide encoded by a nucleotide sequence in SEQ ID NO:B as defined in column 6 
of Table IB or a fragment or variant thereof; or a nucleotide coding sequence in SEQ ID 
NO:B as defined in column 6 of Table IB or the complement thereof. For example, the 
polynucleotide can contain the nucleotide sequence of the full length cDNx\ sequence, 
including the 5' and 3' untranslated sequences, the coding region, as well as fragments, 
epitopes, domains, and variants of the nucleic acid sequence. Moreover, as used herein, a 
"polypeptide" refers to 'a molecule having an amino acid sequence encoded by a 
polynucleotide of the invention as broadly defined (obviously excluding poly-Phenylalanine 
or poly-Lysine peptide sequences which result from translation of a polyA tail of a sequence 
corresponding to a cDNA). 

[38] In the present invention, "SEQ ID NO:X" was often generated by overlapping 
sequences contained in multiple clones (contig analysis). A representative clone containing 
all or most of the sequence for SEQ ID NO:X is deposited at Human Genome Sciences, Inc. 
(HGS) in a catalogued 'and archived library. As shown, for example, in column 2 of Table 
lA, each clone is idenfified by a cDNA Clone ID (identifier generally referred to herein as 
Clone ID NO:Z). Each Clone ID is unique to an individuaLclone and the Clone ID is all the 
information needed to retrieve a given clone from the HGS library. Furthermore, certain 
clones disclosed in this application have been deposited with the ATCC on October 5, 2000, 
having the ATCC designation numbers PTA 2574 and PTA 2575; and on January 5, 2001, 
having the depositor reference numbers TS-1, TS-2, AC-1, and AC-2. In addition to the 
individual cDNA clone deposits, most of the cDNA libraries from which the clones were 
derived were deposited at the American Type Culture Collection (hereinafter "ATCC"). 
Table 7 provides a Hst of the deposited cDNA libraries. One can use the Clone ID NO:Z to 
determine the library source by reference to Tables 6 and 7. Table 7 lists the deposited 
cDNA libraries by name and links each library to an ATCC Deposit. Library names contain 
four characters, for example, "HTWE." The name of a cDNA clone (Clone ID) isolated from 
that library begins with the same four characters, for example "HTWEP07". As mentioned 
below. Table lA correlates the Clone ID names with SEQ ID NO:X. Thus, starting with an 
SEQ ID NO:X, one can use Tables 1, 6 and 7 to determine the corresponding Clone ID, 
which library it came from and which ATCC deposit the library is contained in. Furthermore, 
it is possible to retrieve a given cDNA clone from the source library by techniques known in 
the art and described elsewhere herein. The ATCC is located at 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, USA. The ATCC deposits were made pursuant to the terms 
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of the Budapest Treaty on the international recognition of the deposit of microorganisms for 
the purposes of patent procedure. 

[39] In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous 
nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5kb, 5 
kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, polynucleotides of the 
invention comprise a portion of the coding sequences, as disclosed herein, but do not 
comprise all or a portion of any intron. In another embodiment, the polynucleotides 
comprising coding sequences do not contain coding sequences of a genomic flanking gene 
(i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 500, 
250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 genomic flanking gene(s). 

[40] A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained in 
SEQ ID NO:X, or the complement thereof (e.g., the complement of any one, two, three, four, 
or more of the polynucleotide fragments described herein), the polynucleotide sequence 
delineated in columns 8 and 9 of Table 2 or the complement thereof, and/or cDNA sequences 
contained in Clone ID NO:Z (e.g., the complement of any one, two, three, four, or more of 
the polynucleotide fragments, or the cDNA clone within the pool of cDNA clones deposited 
with the ATCC, described herein), and/or the polynucleotide sequence delineated in column 
6 of Table IB or the complement thereof. "Stringent hybridization conditions" refers to an 
overnight incubation at 42 degree C in a solution comprising 50% formamide, 5x,SSC (750 
^mM NaCl, 75 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 iiig/ml denatured, sheared salmon sperm DNA, 
followed by washing the filters in O.lx SSC at about 65 degree C. 

[41] Also contemplated are nucleic acid molecules that hybridize to the polynucleotides 
of the present invention at lower stringency hybridization conditions. Changes in the 
stringency of hybridization and signal detection are primarily accomplished through the 
manipulation of formamide concentration (lower percentages of formamide result in lowered 
stringency); salt conditions, or temperature. For example, lower stringency conditions 
include an overnight incubation at 37 degree C in a solution comprising 6X SSPE (20X SSPE 
= 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml 
salmon sperm blocking DNA; followed by washes at 50 degree C with IXSSPE, 0.1% SDS. 
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In addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

[42] Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, 
heparin, denatured salmon sperm DNA, and commercially available proprietary formulations. 
The inclusion of specific blocking reagents may require modification of the hybridization 
conditions described above, due to problems with compatibility. 

[43] Of course, a polynucleotide which hybridizes only to polyA4- sequences (such as 

any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid molecule 
containing a poly (A) stretch or the complement thereof (e.g., practically any double-stranded 
cDNA clone generated using oligo dT as a primer). 

[44] The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. For example, polynucleotides can be composed of single- and 
double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- 
and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, 
hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, 
double-stranded or a mixture of single- and double-stranded regions. In addition, the 
polynucleotide can be composed of triple-stranded regions comprising RNA or DNA or both 
RNA and DNA. A polynucleotide may also contain one or more modified bases or DNA or 
RNA backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of modifications can 
be made to DNA and RNA; thus, "polynucleotide" embraces chemically, enzymatically, or 
metabolically modified forms. 

[45] The polypeptide of the present invention can be composed of amino acids joined to 
each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides may be 
modified by either natural processes, such as posttranslational processing, or by chemical 
modification techniques which are well known in the art. Such modifications are well 
described in basic texts and in more detailed monographs, as well as in a voluminous 
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research literature. Modifications can occur anywhere in a polypeptide, including the peptide 
backbone, the amino acid side-chains and the amino or carboxyl termini. It will be 
appreciated that the same type of modification may be present in the same or varying degrees 
at several sites in a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched, for example, as a result of ubiquitination, and 
they may be cyclic, with or without branching. Cyclic, branched, and branched cyclic 
polypeptides may result jfrom posttranslation natural processes or may be made by synthetic 
methods. Modifications include acetylation, acylation, ADP-ribosylation, amidation, 
covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of 
a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, formation 
of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer- 
RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 2nd 
Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, 
Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth. Enzymol. 182:626- 
'646 (1990); Rattan et al., Ann. N.Y. Acad. Sci. 663:48-62 (1992)). 

[46] "SEQ ID NO:X" refers to a polynucleotide sequence described, for example, in 
Tables 1 Aor 2, while "SEQ ID NO:Y" refers to a polypeptide sequence described in column 
6 of Table lA. SEQ ID NO:X is identified by an integer specified in column 4 of Table I A. 
The polypeptide sequence SEQ ID NO:Y is a translated open reading frame (ORF) encoded 
by polynucleotide SEQ ID NO:X. "Clone ID NO:Z" refers to a cDNA clone described in 
column 2 of Table lA. 

[47] "A polypeptide having functional activity" refers to a polypeptide capable of 
displaying one or more known functional activities associated with a full-length (complete) 
protein. Such functional activities include, but are not limited to, biological activity, 
antigenicity [ability to bind (or compete with a polypeptide for binding) to an anti- 
polypeptide antibody], immunogenicity (ability to generate antibody which binds to a 
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specific polypeptide of the invention), ability to form multimers with polypeptides of the 
invention, and ability to bind to a receptor or ligand for a polypeptide. 

[48] The polypeptides of the invention can be assayed for functional activity (e.g. 
biological activity) using or routinely modifying assays known in the art, as well as assays 
described herein. Specifically, one of skill in the art may routinely assay signal transduction 
pathway component polypeptides (including fragments and variants) of the invention for 
activity using assays as described in Examples 38, 39, 49, 52-57, 64 and 67. 
[49] "A polypeptide having biological activity" refers to a polypeptide exhibiting 
activity similar to, but not necessarily identical to, an activity of a polypeptide of the present 
invention, including mature forms, as measured in a particular biological assay, with or 
without dose dependency. In the case where dose dependency does exist, it need not be 
identical to that of the polypeptide, but rather substantially similar to the dose-dependence in 
a given activity as compared to the polypeptide of the present invention (i.e., the candidate 
polypeptide will exhibit greater activity or not more than about 25-fold less and, preferably, 
not more than about tenfold less activity, and most preferably, not more than about three-fold 
less activity relative to the polypeptide of the present invention). 

[50] Table lA summarizes some of the polynucleotides encompassed by the invention 
(including contig sequences (SEQ ID NO:X) and clones (Clone ID NO:Z) and further 
summarizes certain characteristics of these polynucleotides and the polypeptides encoded 
thereby. 
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AR061: 0, AR089: 0 
S0126:2, S0046: 1, 
H0645: 1,H0550: 1 and 
H0135: 1. 


AR050; 204, AR054: 
168,AR051: 151, 
AR089: 9, AR061: 6 
S0140: 2, L0783: 2, 
S0150: 1.L0769: 1. 
L0774: 1,L0775: 1, 


Pro-473 to Asp-478, 
Leii-576 to Lys-585. 


Tyr-114toTrp-119, 
Gln-124 to Ile-129. 


Asp-1 to Arg-7, 
Glii-19 to Leu-32, 
Leu-36 to Ser-49, 
Ser-74 to Pro- 100, 
Ser-113toVa]-130, 
Thr-143 to His-154, 
Gln-161 to Arg-167, 
Val-194 to Phe-200. 


Asp-1 to Arg-7, 
Glu-19 to Leu-32, 
Leu-36 to Ser-49, 
Ser-74 to Pro- 100, 
Ser-113toVai-130, 
Thr-143 to His-154. 
Gln-161 to Arg-167, 
Val-194 to Phe-200. 


Cys-52 toTrp-57, 
Pro-69 to Asp-74, 
Glu-95 to Ser-115, 
Pro- 136 to Gly-143. 
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L0456: 1.H0316: 1, 
H0040: 1,H0264: 1, 
H0623: 1.H0494: 1, 
S0016: 1,S0210: 1, 
L0761: 1,L0771; 1, 
L0650: 1.L0774: 1. 
r n'<7S- 1 T 0784- 1 

L0776: 1.L0655: I, 
H0547: 1,H0659: 1, 
H0670: 1,H0672: 1. 
H0696: 1,S0037: 1, 
S0028: 1,S0032: 1, 
L0744: 1,L0779: 1. 
L0752: l,L0753: 1, 
S0031: 1 andL0366: 1. 


AR061: 3,AR089: 1 
S0222: 1.S6014: 1, 
SOOlO: 1,S6028: 1 and 
S0036: 1. 




Pro-52 to Asp-57, 
Asp-67 to Trp-72, 
Lys-87 to Gly-92, 
Asp-98 to Gly-104. 


Gly-36 to Asp-42, 
Pro-51 to Ala-56, 
Gin-84 to Leu-91, 
His- 105 to His- 112. 
Tyr-115 to Pro- 124, 
Pro-155 to Ser-162. 
Lys-lo / to Aia-1 / J, 
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L0803: 1.L0651: 1, 
L0791: 1,H0539: 1, 
H0521: ],L0780: 1, 
L0753: 1,L0758: 1 and 
SO 192: 1. 




AR089: 4,AR061: 2 
H0551:2.H0581: 1. 
H0560: 1,H0414: 1, 
S0152: 1 and H0522: 1. 


^R061: 1,AR089: 0 
S0052: 2, S0028: 2, 
TO624: 1,H0294: 1, 
50001: 1.S0282: 1, 
-10250: 1,H0271: 1, 




Tyr-7 toTyr-15, 
Pro-43 to Ala-52, 
Gln-57 to Ala-62, 
Asn-68 to Ala-73, 
Tyr-75 to Met-83. 
Glii-115toLeu-I40, 
Ala-144toGlu-156, 
Val-159 to Ser-166, 
Arg-178toPro-186, 
Arg-191 to Ile-198. 




Ala-54 to Ile-59, 
His-71 to His-82. 


Met-5 to Asn-11, 
GIy-20 to Arg-30, 
Thr-36toIle-41, 
His- 136 to Thr-143. 
Thr-152 to Asp-161. 
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H0305: 1 


AR089: 6, AR061: 3 
H0497: 1,H0617: 1, 
L0769: 1,L0766: 1, 
L0775: 1, H0670: 1 and 
H0672: 1. 


AR061: 3.AR089: 2 
L0748: 8,H0212: 3, 
SOOlO: 2, L0438: 2, 
L0752: 2, H0170: 1, 
H0052: 1,H0596: 1. 
H0051: 1,H0032: 1, 
H0068: 1.L0800: 1. 
L0764: 1.L0803: 1, 
L0791: 1,H0521: 1, 
L0749: 1, L0758: 1, 
L0599: 1 and S0192: 1. 


AR089: 2.AR061: 0 
S0045: 1 and S0053: 1. 


Gln-70 to Glu-75. 


Lys-l9to Tlii-26. 


Asp-1 to Lys-12, 
Pro- 18 to Arg-26. 
Asp-51 to Val-74, 
Ala-80 to Leii-102. 




Glu-25 to Ser-30. 
Glu-57 to Thr-62, 
His-64 to.Ser-72. 
His-101 to Pro-106, 
Val-111 to Gin- 117. 


Tyr-7 toTyr-15, 
Pio-43 to Ala-52, 
Gln-57 to AIa-62, 
Asn-68toA'la-73, 
Tyr-75 to Met-83. 


Gln-38 to Ser-51. 
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Pro-1 to Glu-6, 
His- 17 to Lys-22, 
Pro-52 to Gln-58. 


Ala-2 to Gln-9. 
Arg-22 to Val-29, 
Glu-51 toLeu-64. 


His-8 to Gly-18, 
Glu-150 to Leu-167. 


Tyr-83 to Ser-92, 
Leu-118toTyr-123, 
Leu- 137 to Ser-143, 
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S0250: 1,H0212: 1, 
H0412: 1. S0144: 1. 
L0763: I, L0645: 1, 
L0764: 1,L0794: 1, 
L0766: 1, L0775: 1. 
L0783: 1.L0665: 1. 
HU5iy: 1, Hi)435: 1, 
H0672: 1,H0436: 1, 
S3014: 1.S0028: 1, 
L0750: 1,L0777: 1, 
L0366: 1,H0667; land 
H0423: 1. 
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AR061; 3.AR089: 1 
H0618:3and H0253: 
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AR061: 0,AR089: 0 
H0618:64. H0253: 52, 
L0758: 6, L0779: 2, 
H0392: 1,H0038: 1, 




His-12 to Arg-20, 
Pro-26 to Asp-43. 
Ala-62 to Gl 11-70, 
Arg-78 to Arg-83. 
Phe-100 to Gin- 105. 
Gly-129 to Glii-136. 


Tyr-52 to Gln-60. 
Phe-86 to Ala-94, 
Lys-111 to Arg-118. 
His- 193 toTyr-198. 


Pro-3 to Gly-8, 
Val-21 to Gly-30, 
Gly-68 to Ala-85, 
His-94 to Gly-99, 
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S0002: 1,L0631: 1, 
L0769: 1,L0372: 1, 
L0764: 1.L0768: 1. 
L0803: 1.L0783: 1, 
L0545: 1,L0791: 1, 
L0664: 1,L0665: 1, 
H0144- 1 1 0438- 1 
H0689: 1,S0380: 1, 
S0013: 1,H0696: 1, 
H0555: 1,L0743: 1, 
L0744: 1,L0747: 1, 
L0731: 1,L0759: 1, 
L0596: land L0604: 1. 


AR061: 6,AR089: 2 
L0755: 5, H0212: 2, 
L0439: 2. L0754: 2, 
H0393: 1. H0409: 1, 
L0764:'l,L0662: 1. 
L0803: 1, L0382: 1. 
L0666- 1 L0438- 1 
L0749: 1 and L0752: 1. 


AR089: 1.AR061: 0 
L0747: 5, L0439: 3, 
L0756: 3, L0775: 2. 




Gly-43 to Tyr-50. 






Asn-36 to Gly-43, 
Gly-66toGlii-73, 
Ser-86 to Pro-92, 
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139190. 
139190, 
224100, 
600281, 
600281, 
601002, 
601002, 
601146. 
601 146. 
601146 










cr 
o 




AR089: 64,AR061: 15 
H0542: 2 


AR089; 3,AR061: I 
H0328: 1,'L0758: 1 
andH0543:l. 


AR089: 7,AR061: 1 
L0589: 1,H0542: 1 
and H0543: 1. 


AR089: 68,AR061: 29 


Glu-49toAsn-59, 
Lys-70 to Lys-82, 
Gly-99to Cys-116. 


Glu-4 to Leu-11, 
Gln-30toCys-40, 
Pro-53 to Pro-59, 
Thr-99 to Ser-104. 


Met-22 to Trp-27. 


Gin- 13 to Ile-29. 


Gln-13to Ile-29. 


Gln-1 toThr-6. 
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L0731: 1.S0031: 1, 
L0596: 1,L0608: 1 and 
SOOll: 1. 


AR061: 5,AR089: 4 
L0748:2,H0191: I, 
HO 100: 1,S0002: 1, 
L0646: 1,L0794: 1, 
L0806: landL0758: 1. 


AR061: 5,AR089: 3 
L0766: 2, H0575: 1, 
H0179: 1,H0416: 1, 
H0539: land L0592: 1. 


AR089: 3,AR061: 3 
S0282: 1 and S0053: 1. 


AR061: 6.AR089: 3 
H0615:2andL0766: 
1. 




Phe-3 to Phe-8, 
Pro-30 to Glu-36, 
Pro-93 to Arg-98. 


Ser-41 to Glu-47. 
Arg-71 toLeu-85, 
Asp-87 to Glu-97. 


Glu-15 to His-24, 
Asn-47 to His-53. 


Gln-1 to Gly-7, 
Ser-63 to Gly-68. 
Pro-74 to Cys-81. 


Glu-1 to Glu-22. 


Asp-29 to Arg-35, 
Leu-58 to Thr-64. 


Asp-29 to Arg-35, 
Leu-58 to Thr-64. 


Ile-34 to Gly-42. 
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H0085: 1,H0235: 1, 
H0545: 1,H0012: 1, 
H0644: 1. HO 124: 1, 
H0634: 1.H0494: 1, 
S0144: 1,S0I42: 1, 
L0638: l,L0642: 1, 
r 0764- 1 1 077 1 ■ 1 

L^\J t \J^. 1 , L-d\J / f I . If 

L0773: 1,L0768: 1, 
L0649: 1,L0774: 1, 
L0775: 1,L0651: 1. 
L0653: 1,L0776: 1, 
L0659: 1,L0809: 1. 
S0374: 1,H0690: 1, 
H0522: 1,H0696: 1, 
L0740; 1,L0754: 1, 
L0755: 1,L0731; 1, 
L0757: 1,H0707: 1, 
L0601: 1 andH0543: 1. 


AR054: 26,AR051: 
12,AR050: 10,AR061: 
7.AR089: 4 
H0556: 4. L0770: 4. 
L0794: 4, L0758: 4, 
L0731: 3,H0038: 2. 
L0766- 2 L0659: 2 
S0212: 1,S0132: 1, 




Arg-41 toThi-53, 
Ser-89 to Glu-95, 
Leii-109 to Lys-114, 
Pro-189 toGlu-194. 
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S0136: 3, L0779: 3. 
H0171: 1,H0052: 1, 
H0038: 1,L0766: 1, 
H0547: 1,S0031: 1 aad 
S0242: 1. 


AR089: 26, AR061: 4 
S0354: 1,H0561: I and 
L0603: 1. 


AR061: 49,AR089: 19 


AR061: 2,AR089: 1 
L0766: 2, SOOOl: 1, 
H0592: 1,H0575: 1, 
H0644: 1.H0038: 1 and 
H0144: 1. 


AR089: 1,AR061: 0 
H0638:2, H0521: 2, 
L0752: 2, H0677: 2, 
H0650: 1,H0484: 1. 
H0458: 1.H0580: 1. 
H0586: 1,H0575: 1, 
H0081; 1,S0036: 1, 
H0063: l,H0560: 1, 
L0809: 1, SO 126: 1. 
S0328: 1,L0744: 1, 
L0740: 1,L0754: 1 and 




Ile-44 to Gln-50. 


Asp-1 to Pro-12. 


Glu-18to Thr-23. 
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L0163: 4, L0777: 4, 
L0520; 3, H0002: 2. 
S0364: 2, S0330: 2, 
L0747:2, H0171: 1, 
H0549: 1.H0486: 1, 
H0013: 1,H025,3: 1, 
H0318- 1 S0049' 1 
H0251: 1,L0471: 1. 
S0051: 1,H0616: 1, 
S0038: I, HO 100: 1, 
H0561: 1.L0803: 1, 
L0782: 1,L0809: 1, 
L0779: l,L0759: 1 and 
L0584: 1. 




His-lOto Gly-16, 
Pro-65 to Ala-70, 
Ala-96 to Lys-lOl. 
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52-705 








975280 
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[51] The first column in Table lA provides the gene number in the application 
corresponding to the clone identifier. The second column in Table lA provides a unique 
"Clone ID NO:Z" for a cDNA clone related to each contig sequence disclosed in Table 1 A. 
This clone ID references the cDNA clone which contains at least the 5' most sequence of the 
assembled contig and at least a portion of SEQ ID NO:X was determined by directly 
sequencing the referenced clone. The reference clone may have more sequence than 
described in the sequence listing or the clone may have less. In the vast majority of cases, 
however, the clone is believed to encode a full-length polypeptide. In the case where a clone 
is not full-length, a full-length cDNA can be obtained by methods described elsewhere 
herein. 

[52] The third column in Table lA provides a unique "Contig ID" identification for 
each contig sequence. The fourth column provides the "SEQ ID NO:" identifier for each of 
the contig polynucleotide sequences disclosed in Table 1 A. The fifth column, "ORE (From- 
To)", provides the location (i.e., nucleotide position numbers) within the polynucleotide 
sequence "SEQ ID NO:X" that delineate the preferred open reading frame (QRF) shown in 
the sequence listing and referenced in Table 1 A, column 6, as SEQ ID NO:Y. Where the 
nucleotide position number "To" is lower than the nucleotide position number "From", the 
preferred ORF is the reverse complement of the referenced polynucleotide sequence. 
[53] The sixth colunm in Table lA provides the corresponding SEQ ID NO:Y for the 
polypeptide sequence encoded by the preferred ORF delineated in column 5. In one 
embodiment, the invention provides an amino acid sequence comprising, or alternatively 
consisting of, a polypeptide encoded by the portion of SEQ ID NO:X delineated by "ORF 
(From-To)". Also provided are polynucleotides encoding such amino acid sequences and the 
complementary strand thereto. 

[54] Column 7 in Table lA lists residues comprising epitopes contained in the 
polypeptides encoded by the preferred ORF (SEQ ID NO:Y), as predicted using the 
algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf 
antigenic analysis was performed using the computer program PROTEAN (Version 3.1 1 for 
the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). In specific 
embodiments, polypeptides of the invention comprise, or alternatively consist of, at least one, 
two, three, four, five or more of the predicted epitopes as described in Table lA. It will be 
appreciated that depending on the analytical criteria used to predict antigenic determinants, 
the exact address of the determinant may vary slightly. 
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[55] Column 8 in Table 1 A provides an expression profile and library code: count for 
each of the contig sequences (SEQ ID NO:X) disclosed in Table lA, which can routinely be 
combined with the information provided in Table 4 and used to determine the tissues, cells, 
and/or cell line libraries which predominantly express the polynucleotides of the invention. 
The first number in column 8 (preceding the colon), represents the tissue/cell source 
identifier code corresponding to the code and description provided in Table 4. For those 
identifier codes in which the first two letters are not "AR", the second number in column 8 
(following the colon) represents the number of times a sequence corresponding to the 
reference polynucleotide sequence was identified in the tissue/cell source. Those tissue/cell 
source identifier codes in which the first two letters are "AR" designate information 
generated using DNA array technology. Utilizing this technology, cDNAs were amplified by 
PGR and then transferred, in duplicate, onto the array. Gene expression was assayed through 
hybridization of first strand cDNA probes to the DNA array. cDNA probes were generated 
from total RNA extracted from a variety of different tissues and cell lines. Probe synthesis 
was performed in the presence of^^P dGTP, using ohgo(dT) to prime reverse transcription. 
After hybridization, high stringency washing conditions were employed to remove non- 
specific hybrids from the array. The remaining signal, emanating from each gene target, was 
measured using a Phosphorimager. Gene expression was reported as Phosphor Stimulating 
Luminescence (PSL) which reflects the level of phosphor signal generated from the probe 
hybridized to each of the gene targets represented on the array. A local background signal 
subtraction was performed before the- total signal generated from each array was used to 
normalize gene expression between the different hybridizations. The value presented after 
"[array code]:" represents the mean of the dupHcate values, following background subtraction 
and probe normalization. One of skill in the art could routinely use this information to 
identify normal and/or diseased tissue(s) which show a predominant expression pattern of the 
corresponding polynucleotide of the invention or to identify polynucleotides which show 
predominant and/or specific tissue and/or cell expression. ' 

[56] Column 9 in Table lA provides a chromosomal map location for certain 
polynucleotides of the invention. Chromosomal location was determined by finding exact 
matches to EST and cDNA sequences contained in the NCBI (National Center for 
Biotechnology Information) UniGene database. Each sequence in the UniGene database is 
assigned to a "cluster"; all of the ESTs, cDNAs, and STSs in a cluster are believed to be 
derived from a single gene. Chromosomal mapping data is often available for one or more 
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sequence(s) in a UniGene cluster; this data (if consistent) is then applied to the cluster as a 
whole. Thus, it is possible to infer the chromosomal location of a new polynucleotide 
sequence by determining its identity with a mapped UniGene cluster. 

[57] A modified version of the computer program BLASTN (Altshul et al., J. Mol. 
Biol. 215:403-410 (1990); and Gish and States, Nat. Genet. 3:266-272 (1993)) was used to 
search the UniGene database for EST or cDNA sequences that contain exact or near-exact 
matches to a polynucleotide sequence of the invention (the 'Query'). A sequence from the 
UniGene database (the 'Subject') was said to be an exact match if it contained a segment of 
50 nucleotides in length such that 48 of those nucleotides were in the same order as found in 
the Query sequence. If all of the matches that met this criteria were in the same UniGene 
cluster, and mapping data was available for this cluster, it is indicated in Table 1 A under the 
heading "Cytologic Band". Where a cluster had been further localized to a distinct cytologic 
band, that band is disclosed; where no banding information was available, but the gene had 
been localized to a single chromosome, the chromosome is disclosed. 

[58] Once a presumptive chromosomal location was determined for a polynucleotide of 
the invention, an associated disease locus was identified by comparison with a database of 
diseases which have been' experimentally associated with genetic loci. The database used was 
the Morbid Map, derived from OMIM™ (supra). If the putative chromosomal location of a 
polynucleotide of the invention (Query sequence) was associated with a disease in the 
Morbid Map database, an OMIM reference identification number was noted in column 10, 
Table lA, labelled "OMIM Disease Reference(s)". Table 5 is a key to the OMIM reference 
identification numbers (column 1), and provides a description of the associated disease in 
Colunan 2. 
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TABLE IB 



Clone ID 
NO:Z 


SEQ ID 
NO:X 


CONTIG 
ID: 


BAC ID: A 


SEQ ID 
NO:B 


EXON 
From-To 


HFCBB56 : 


24 


910073 


AC068296 


1268 


1-225 


HIBBF63 


75 


912715 • 


AC009065 


1269 


1-70 
850-1112 
1169-1622 
1707-1779 
1874-1924 
2836-2908 
3006-4160 


HIBBF63 


75 


912715 


AC012171 


1270 


1-64 
159-209 
1122-1194 
1292-1527 
1593-2446 


HIBBF63 


75 


912715 


AC005346 


1271 


1-70 
874-1 136 
1193-1646 
1731-1803 
1898-1948 
2861-2933 
3031-4185 


H1BBF63 


75 


912715 


AC009065 


1272 


1-547 


HIBBF63 


75 


912715 


AC012171 


1273 


1 -547 


H1BBF63 


75 


912715 


AC009065 


1274 


1-424 


HIBBF63 


75 


912715 


AC005346 


1275 


1-547 


H1BBF63 


75 


912715 


AC012171 


1276 


1-419 


H1BBF63 


75 


912715 


AC005346 


1277 


1-424 


H2CBH45 


90 


963811 


AC068243 


1278 


1-267 
1540-1640 
3095-3380 
3393-3556 
3901-3967 
4137-4639 
5287-5856 
SQ1 6-6588 
7029-7876 
8324-8414 


H2CBH45 


90 


963811 


AC068243 


1279 . 


1-309 


HBGQT03 


93 


908173 


AC024045 


1280 


1-218 
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457-549 












660-819 












2039-2238 












2529-2763 












2876-3033 , 












3631-3810 












3941^58 












4184-4322 












4727-4851 












5161-6181 


HBGQT03 


93 


908173 


AC024045 


1281 


1-176 


HBGQT03 


93 


908173 


AC024045 


1282 


1-461 
960-1030 
1194-1959 
2041-2516 
3037-3122 
3396-3455 
4055-4366 
4547-4599 
4967-5216 
5321-5461 
6521-7174 
7564-7841 
8311-8758 
8829-8969 
8997-10118 
10257-10910 
12058-12385 
12438-12953 
13729-13873 


HCEPH71 


97 


522739 


AL365319 


1283 


1-494 


HCEPH71 


97 


522739 


AL390715 


1284 


1-494 


HCOOZl 1 


100 


965306 


AL022238 


1285 


1-121 
899-983 
1445-1513 
2166-3430 
3550-3763 

4449-4595 
4960-5152 
5385-5529 
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574^-5972 | 
6327-7067 
7097-7152 
7210-8073 
8079-8680 
8772-11399 
12956-13517 
13736-14155 
14311-14753 
16294-16357 
16648-16806 
16874-17059 
17685-17787 


HCOOZll 


100 


965306 


AL022238 


1286 


1-540 


HCOOZll 


100 


965306 


AL022238 


1287 


1-665 


HCWFF88 


101 


506577 


AC025670 


1288 


1-300 


HCWFF88 


101 


506577 


AL157951 


1289 


1-624 


HCWFF88 


101 


506577 


AL157951 


1290 


1-409 


HCWFF88 


101 


506577 


AL157951 


1291 


1-83 


HDPFF24 


104 


909232 


AC020910 


1292 


1-353 
359-468 
787-861 
1877-2199 
4963-5089 
5342-5440 
6133-8734 
9933-10319 


HDPFF24 


104 


1 909232 


AC020910 


1293 


1-814 


HDPFF24 


104 


909232 


AC020910 


1294 


1-437 


HDTKQ14 


107 


886936 ■ 


AL359542 


1295 


1-140 
1249-4264 


HPTKQ14 


107 


886936 


AL023653 


1296 


1-140 
1249-4264 


HDTKQ14 


107 


886936 


AL359542 


1297 


1-499 


HDTKQ14 


107 


886936 


AL359542 


1298 


1-145 


HDTKQ14 


107 


886936 


AL023653 


1299 


1-499 


HFTDF15 


113 


657020 


AL365277 


1300 


1-406 


HFTDF15 


113 


657020 


AC0245 1 1 


1 irv 1 
liUl 


1-406 


HFTDFiS 


113 


657020 


AL365277 


1302 


1-430 


HFTDFIS 


113 


657020 


AC024511 


1303 


1-430 


HFTDFIS 


113 


657020 


AL365277 


1304 


1-526 
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HFTDF15 


113 < 


557020 


^C024511 


1305 


1-526 


HLQDT35 


117' 


839777 


^CO 10998 


1306 


' 1-44 










540-884 












1203-1261 












1994-2178 












2303-2474 












2991-3088 












3592-3757 












4262-4364 












4742-5802 












6235-7057 












7126-8472 


HLQDT35 


117 


839777 


AC013357 


1307 


1-44 










540-884 












1203-1261 








■ 




1994-2178 












2303-2474 












2991-3088 












3592-3757 












4262-4364 












4742-5802 












6235-7057 












7126-8472 


HLQDT35 


117 


839777 


AGO 10998 


1308 


. 1-768 


HLQDT35 


117 


839777 


AC013357 


13.09 


1-6035 










8430-11057 


HLQDT35 


117 


839777 


AGO 10998 


1310 


1-278 


HLQDT35 


117 


839777 


AG013357 


1311 


1-278 


HLWFN63 


118 


908437 


AC006599 


1312 


1-30 












1525-1711 












5428-5502 












7038-7273 












7590-7735 












8960-9049 












11665-11800 












12889-13194 












13907-14119 












14889-15043 












15926-16164 












18759-19079 












20581-20693 
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22531-22783 
23817-24956 
26153-26283 
26791-27141 
28145-29220 


HLWFN63 


118 


908437 


AL033378 


1313 


1-30 
1525-1711 
5428-5502 
7038-7261 
7590-7735 
11665-11800 
12889-13194 
13907-14119 
14889-15043 
15926-16164 
18759-19079 
20581-20693 
22531-22753 
23817-24956 
26153-26283 
26791-27141 
28145-29220 


HLWFN63 


118 


908437 


AC006599 


1314 


1-2939 


HLWFN63 


118 


908437 


AL033378 


1315 


1-2939 


HMSCD15 


120 


918133 


AC027008 


1316 


1-1190 


HMSCD15 


120 


918133 


AL158207 


1317 


1-130 
923-1252 
1765-3269 
4138-4483 
6546-7734 


HMSCD15 


120 


918133 


AL158207 


1318 


1-371 


HPMFL08 


128 


959569 


Z93016 


1319 


1-477 


HPMFL08 


128 


959569 


Z93016 


1320 


1-650 


HTEAG49 


135 


954614 


AL390796 


1321 


1-1310 


HTEAG49 


135 


954614 


AL357045 


1322 


1-1310 


HTEAG49 


135 


954614 


AL390796 


1323 


1-627 


HTEAG49 


135 


954614 


AL357045 


1324 


1-627 


HTLBH67 


136 


751985 


AC008439 


1325 


1-62 
293-400 
452-976 
1016-1058 
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1463-1534 
1886-2026 
2110-2249 
2401-2463 
3324-4027 
4192-4288 
4694-5330 
5485-5650 
5813-6262 
6273-6401 
6475-6559 
6728-6847 
6979-7205 
7573-7676 
7730-8146 
8334-8866 
8885^9392 


HTLBH67 


136 


751985 


AC008781 


1326 


1-85 












254-371 












505-731 












1098-1201 












1255-1671 












1718-2387 












2408-2915 












3113-3244 












. 3382-4278 












4504-4538 












4650-5645 


HTLBH67 


136 


751985 


ACO'22420 


1327 


1-62 
295-403 
455-979 
1019-1061 
1466-1537 
1890-2030 
2114-2253 
2405-2467 
3328-4030 

A 1 /too 1 

4iy!)-4zyi 
4697-5333 
5488-5653 
5816-6265 
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6276-6404 
6478-6562 
6731-6850 
6982-7208 
7575-7678 
7732-8148 
8195-8864 
8885-9392 
9590-9721 
9859-10754 
10980-11014 
11126-12121 


HTLBH67 


136 


751985 


AC005368 


1328 


1-64 
294-399 
451-975 
1015-1057 
1462-1533 
1885-2025 
2109-2248 
2400-2462 
3323-4026 
4191-4287 
4693-5329 
5484-5649 
5812-6264 
6275-6403 
6477-6561 
6730-6849 
6981-7207 
7575-7678 
7732-8148 
8201-8868 
8887-9394 
9592-9723 
9861-10759 
10985-11019 
11131-12126 


HTLBH67 


136 


751985 


AC008781 


1329 


1-292 


HTLBH67 


136 


751985 


AC022420 


1330 


1-323 
1372-1431 
1657-1821 
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2377-2485 
4488-4700 
4954-5061 
6224-6547 
6819-6965 
7268-7333 
8088-8593 
9897-10068 
10109-10623 
10645-10680 
10812-10871 
10982-11123 
11345-11383 
11877-12000 
12310-13467 


HTLBH67 


136 


751985 


AC022420 


1331 


1-292 


HTLBH67 


136 


751985 


AC005368 


1332 


1-292 


HTLJC71 


137 


922923 


AC0O9516 


1333 


1-2009 


HTUC71 


137 


922923 


AC007957 


1334 


1-1747 


HTUC71 


137 


922923 


AGO 18751 


1335 


1-2009 


HTUC71 


137 


922923 


AC023490 


1336 


1-2009 


HTUC71 


137 


922923 


AC009516 


1337 


1-375 


HTLJC71 


137 


922923 


AC009516 


1338 


1-494 


HTUC71 


137 


922923 


AC007957 


1339 


1-205 


HTUC71 


137 


922923 


AGO 18751 


1340 


1-494 


HTUC71 


137 


922923 


AC023490 


1341 


1-375 


HTUC71 


137 


922923 


AGO 18751 


1542 


1-375 


HTPAD46 


138 


503313 


AGO 10932 


1343 


1-3347 


HTPAD46 


138 


503313 


AL133510 


1344 


1-5377 


HWMBM13 


144 


909683 


AL158847 


1345 


1-1445 
1668-1817 
1931-2643 


HWMBM13 


144 


909683 


ALl 58847 


1346 


1-396 


HWWDN34 


145 


911357 


AGO 192 14 


1347 


1-160 
713-910 
1069-1269 
3997-4098 
4303-4397 
5035-5098 
5740-5796 
6024-6155 
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6697-6813 
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7491-7586 












7922-8003 












8015-8421 












8432-8624 












8714-8856 












8943-10332 












10482-10901 












1 154 /-1 1^34 












1 T 1 1 n 1 1 77 
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1 '^'^ 10-1417^ 


H1BEC79 


383 


703000 


ACUl 14D(5 


1 /17^ 




H1BEC79 


383 


703000 


ACOl 1458 


14/d 


1 9R7 


HNFHS82 


387 


779946 


AC010835 


1477 


1 _A1 52 


HFPBB28 


389 


844526 


AC016135 


1478 


1 RA^ 


HFPBB28 


389 


844526 




1 A7Q 


1-776 


HFPBB28 


389 






14.80 


1-240 


HFPBB28 


389 


844526 


AC002518 


1481 


1-150 


HDQGZ78 


399 


909735 


AC026282 


1482 


1-238 












976-1440 
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2143-2356 












6769-6910 












9591-9648 












9951-10098 


HSIDQ38 


401 


909854 


AC003070 


1483 


1-152 
3039-3473 
4301-4483 
4678-4795 
5280-5944 
6055-6117 
6290-6359 
6677-6761 
8475-9284 . 
11404-11918 
12112-12437 
12443-13065 
13153-13467 
13593-13719 
13799-14185 
14224-16489 


HFTBL33 


407 


910055 


AC022366 


1484 


1-565 
1503-1718 
1838-1933 
2011-2097 
2265-2335 
2588-2693 
2905-2975 
3090-3726 
3809-3889 
4080-4591 
4847-5070 
5355-5819 


HFTBL33 


407 


910055 


AC025165 


1485 


1-565 
1503-1718 
1838-1933 
2011-2097 
2265-2335 
2588-2693 
2905-2975 
3090-3726 
3809-3889 
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4080-4591 
4847-5070 
5355-5819 


HFrBL33 


407 


910055 


AC025165 


1486 


1-604 


HUFCI64 


411 


911558 


AC004151 


1487 


1-145 
359-443 
527-599 
798-868 
958-1095 
1196-1260 
1465-1577 
1652-1732 
2256-3158 
4031-4899 
4984-5306 
5735-6066 
6554-6694 
6780-6970 
7107-7232 
7316-7404 
7529-7643 
7744-7917 
8401-8592 
8675-8813 
9685-9920 
9958-10211 
,10485-11014 
11088-11199 
11958-15576 
16324-16465 
16587-16818 
16939-17000 
17440-17554 
17558-17946 
18645-18765 
19015-19378 
20522-20937 


HUFCI64 


411 


911558 


AC004151 


1488 


1-134 


HWAFT84 


412 


911559 


AC004151 


1489 


1-145 
359-443 
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527-599 
798-868 
958-1095 
1196-1260 
1465-1577 
1652-1732 
2256-3158 
4031-4899 
4984-5306 
5735-6066 
6554-6694 
6780-6970 
7107-7232 
7316-7404 
7529-7643 
7744-7917 
8401-8592 
8675-8813 
9685-9920 
9958-10211 
10485-11014 
11088-11199 
11958-15576 
16324-16465 
16587-16818 
16939-17000 
17440-17554 
17558-17946 
18645-18765 
19015-19378 
20522-20937 
22111-22452 


HWAFT84 


412 


911559 


AC004151 


1490 


1-134 


HWADR60 


416 


926487 


AC023176 


1491 


1-178 
293-506 
542-940 
1591-2005 
2031-2104 
2390-2509 
3681-3797 
4018-4165 
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4267-4381 
4704-4736 
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526487 


AC023176 


1492 


1-162 
443-739 
1067-1458 
1745-1877 
1976-2119 
2816-2883 
3171-3294 
3727-4154 
4340-4442 
5251-6126 
6708-7176 
7418-7880 
8134-8752 
9979-10164 
11234-11413 
12532-12666 
13313-13459 
14761-14898 
15208-15308 
16207-16518 


HPCIG66 
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AC024888 


1493 


1-36 
149-234 
537-623 
852-921 
1077-1728 
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AC024888 
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1-61 
133-210 
992-1107 
1310-1644 
1834-1905 
2133-2254 
2927-3032 
4154-4254 
4482-4683 


HPCIG66 


419 


930886 


AC024888 


1495 


1-63 

574-1064 
1763-2190 
2394-2604 
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3452-4040 
6187-6254 


HCRPU72 


420 


931140 


AC023151 


1496 


1-65. 
721-1042 


HE9RT95 


421 


934556 


AC008439 


1497 


1-57 
311-418 
1581-1904 
2176-2322 
2625-2690 
3445-3950 
5254-5425 
5466-5980 
6002-6037 
6169-6228 
6339-6400 
6701-6739 
7238-7349 
7664-8821 


HE9RT95 


421 


934556 


AC022420 


1498 


1-323 
1372-1431 
1657-1821 
2377-2485 
4488-4700 
4954-5061 
6224-6547 
6819-6965 
7268-7333 
8088-8593 
9897-10068 
10109-10623 
10645-10680 
10812-10871 
10982-1 1 123 
1 1345-1 13o3 
11877-12000 
12310-13467 


HE9RT95 


421 


934556 


AC022420 


1499 


1-389 


HE9RT95 


421 


934556 


AC022420 


1500 


■ 1-62 
295-403 
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455-979 
1019-1061 
1466-1537 
1890-2030 
2114-2253 
2405-2467 
3328-4030 
4195-4291 
4697-5333 
5488-5653 
5816-6265 
6276-6404 
6478-6562 
6731-6850 
6982-7208 
7575-7678 
7732-8148 
8195-8864 
8885-9392 
9590-9721 
9859-10754 
10980-11014 
11126-12121 


HWADD57 


426 


943039 


AGO 11492 


1501 


1-303 
949-1648 
1913-2937 
3032-3231 
3325-3443 
4093-4485 
4777-4936 
5057-5548 
5650-5968 


HWADD57 


426 


943039 


AGO 11492 


1502 


1-50 
852-907 
988-1407 
1584-1839 
2455-2586 
2689-2787 


HFKHR40 


431 


952470 


AGO 18805 


1503 


1-525 
612-1372 
1476-1730 
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1732-2155 

2345-2460 

2652-3025 

3157-3251 

3449-3540 

3680-3780 

3914-4131 

4215-4491 

4603-4741 

4913-4987 

5135-5190 

5435-5571 

5901-6011 

6309-6423 

6922-8294 

8370-8522 


HFKHR40 


431 


952470 


AC061707 


1504 


1-527 
614-1374 
1478-1732 
1734-2158 
2348-2463 
2655-3027 
3159-3253 
3451-3542 
3682-3782 
3916-4134 
4219-4495 
4607-4745 
4917-4991 
5139-5194 
5439-5575 
5905-6015 
6313-6427 
6926-9300 
9919-9960 
10029-10186 
11393-11624 
12094-12294 
13227-13375 
13690-13829 
13921-14010 
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14362-14486 


HFKHR40 


431 


952470 


ACO 18805 


1505 


' 1-343 












700-770 


HFKHR40 


431 


952470 


AC061707 


1506 


1-343 












700-771 
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952470 
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1-277 
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963422 


ACO 10545 
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1-40 












1661-1891 












2119-2199 












5160-5349 












6239-6607 












7675-8566 












9450-9516 












9675-9752 












10110-10274 












14154-15055 












16384-16500 












17055-17139 












19941-20453 






• 






20703-21216 












21806-21945 












23638-24171 












24527-24795 












25564-25656 












26644-26787 












27284-27438 












28354-28612 












29247-29591 












29597-30208 












32018-32539 












33187-33942 


HWLHFIO 


435 


963422 


ACO 10545 


1509 


1-721 


HWLHFIO 


435 


963422 


AC010545 


1510 


1-610 












675-1454 












1591-2267 












2801-3363 
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[59] Table IB summarizes additional polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences (contig 
identifier (Contig ID:) contig nucleotide sequence identifiers (SEQ ID NO:X)), and genomic 
sequences (SEQ ID NO:B). The first column provides a unique clone identifier, "Clone ID 
NO:Z", for a cDNA clone related to each contig sequence. The second column provides the 
sequence identifier, "SEQ ID NO:X", for each contig sequence. The third column provides a 
unique contig identifier, "Contig ID:" for each contig sequence. The fourth column, provides 
a BAC identifier "BAC ID NO:A" for the BAC clone referenced in the corresponding row of 
the table. The fifth column provides the nucleotide sequence identifier, "SEQ ID NO:B" for 
a fragment of the BAC clone identified in column four of the corresponding row of the table. 
The sixth column, "Exon From-To", provides the location (i.e., nucleotide position numbers) 
within the polynucleotide sequence of SEQ ID NO:B which delineate certain polynucleotides 
of the invention that are also exemplary members of polynucleotide sequences that encode 
polypeptides of the invention (e.g., polypeptides containing amino acid sequences encoded 
by the polynucleotide sequences delineated in column six, and fragments and variants 
thereof). 
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